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ABSTRACT 

The Rasch rating scale model was used to provide 
diagnostic information about a teacher effectiveness instrument, the 
Inventory of Myself as a Professional (IMP). Two forms of the 
instrument, a self-report form to be completed by the teacher and an 
observation form to be completed ■>y a supervisor or administrator 
contained the same thirty-six items. The items were included on three 
subscales: (1) student to teacher relationships; (2) professional 
attitude; and (3) professional skills. Response patterns were scored 
on a scale from one to five. The effectiveness of 204 teachers as 
observed by their principals on the Observation Form of the IMP and 
the effectiveness of 211 teachers as reported on the Self-Report Form 
of the IMP were assessed. Data for both forms were collected for 
teachers from public elementary and secondary schools in Colorado. 
Both forms appeared to fit the Rasch model quite well. The teacher 
effectiveness variable was defined on both instruments, providing 
construct validity to the IMP. item calibrations sufficiently spread 
the person measures to ensure an adequate fit of the model on both 
forms. Having person measures contributed to the diagnostic potential 
of the IMP. Results suggest that the Rasch model provides a major 
breakthrough in analyzing the variable of teacher effectiveness in 
the use of item calibrations on a hierarchical continuum. 
Instructions for both forms of the instrument are included. A table 
and six figures illustrate the study results. (SLD) 
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SUMMARY 

The Rasch Rating Scale Model is applied to provide 
diagnostic information on a teacher effectiveness instrument 
called the Inventory of Myself as a Professional (IMP). Thirty- 
six items are included in three subscales: student teacher 
relationships, professional attitude, and professional skills. 
Items are identical on two separate forms of the instrument: a 
Self-Report Form, and an Observation Form. Response patterns are 
scored on a scale of one to five. 

Data for both instruments were collected on 211 teachers 
whose school buildings were randomly selected from the state of 
Co"''>rado public schools, grades K-12, as part of a validation 
study conducted at the University of Northern Colorado and 
endorsed by the Colorado Department of Education. 

Item and person analyses are positioned on an attitudinal 
scale in logits. Scales are analyzed separately on each form by 
the items positions at their calibrations on the attitude 
variable. Persons are ordered and compared by professional level 
elemen^ary and secondary. 

Items and persons are compared within tests and across both 
forms. The item difficulties and person measures provide insight 
into the dimension of teacher effectiveness not previously 
available in teacher assessment. Diagnostic implications of both 
parameters allows for new application of the teacher effectiveness 
dimension. 
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INTRODUCTION 

Teacher effectiveness is currently a prominent national 
issue. Soar, Medley, and Coker (1983) claim the need for better 
ways of evaluating teachers to identify qualified teaching 
personnel as urgent. Bennett (1978) suggests that the current 
methods of teacher evaluation are ineffective. Current research 
methods have not provided specific direction. This lack of 
specificity is due partly to the nature of the definition of 
teacher effectiveness and a lack of consensus in the research 
findings. 

Borich (1977) reported no scaled dia^ostic measures in 
teacher effectiveness in his revinw of teacher evaluation 
instruments from the Ryans study in the 1950s through the middle 
1970s. Several states have responded to mandated legislation for 
teacher certification with their own performanced-based 
instruments. Burry (19P4) reviewed the psychometric properties of 
these instruments and found that the reliability coefficients did 
not exceed .77 for a total instruments, and the scales of these 
instruments to be considerably lower. Validity studies were 
limited to content validity. Attempts were made to establish 
construct validity through factor analysis; however, none of the 
scales were supported by this statistical procedure. There 
appears to be a lack of consistency in what the experts in the 
field judge to be one dimension and what the statistical 
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procedures define as another. These congruencies only contribute 
to the lack of consistency and specificity needed to provide a 
clear, practical, and workable definition of teacher 
effectiveness. 

Perhaps part of the problem lies in the instruments 
themselves and the other part of the models applied in the 
validation process. At this time the literature does not report 
an effective use of the Rasch Model in validating the domain of 
teacher effectiveness. Use of the present instrument results is 
limited to pass-fail, certif ied-not certified, hire-fire, or more- 
no additional monies concepts rather than how can the teacher (s) 
develop into a more effective professional. Wright and Masters 
(1962.. p. 90), state, "The purpose of a measurement model is to 
extract from suitable data a useful definition of an intended 
variable and then to measure persons on this variable." The Rasch 
Model attempts to verify that the items on an instrument work 
together to define the variable. Applying the Rasch Model to a 
teacher effectiveness instrument would provide not only the 
validation of the teacher effectiveness dimension, but item 
calibrations and person ability measures. This information would 
be most useful in providing diagnostic information. 

INSTRUMENT DEVELOPMENT 
The IMP was created to attempt to provide substance and 
clarity t6~the definition and dimension of teacher effectiveness 
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along with diagnostic information. All of the items and the 
scales were constructed based upon the literature. The t*iree 
scales are: 1) Teacher-Student Relationships, 2) Professional 
Attitude, and 3) Teaching Skills. The two forms were developed 
(self ^report and an observation instrument) to be used by a 
supervisor or an administrator. These forms are identical with 
the exception of the preface statement. The self -report 
instrument asks the teacher to rate themselves on items prefaced 
by, " AS A CLASSROOM TEACHER ." while the Observation Form asks the 
rater to respond to the items prefaced by, " THIS CLASSROOM 
TEACHER ." 

All 36 itcr.s are of a uniform style and were constructed on 
a Svmunated Rating Scale. Scale responses are: (A) almost never 
true, (B) seldom true, (C) true about half of the time, (D) true 
most of the time, and (E) almost always true. Responses were 
converted to a numerical scale where (A) almost never true has a 
weight of 1, (B) seldom true has a weight of 2, on up to (E) 
almost always true which has a weight of 5. Item responses were 
summed into scales and a total instrument value. 

After a number of item trials and instrument revisions, a 
final validation study was done using a systematic, stratified 
sample of 150 schools systematically and randomly selected in the 
state of Colorado. There were 204 teachers who were observed by 
their building principal and 211 teachers who responded to the 
self -report instnaments. 
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RASCH RATING SCALE MODEL 
Because the response format of the IMP is a Summative Rating 
Scale, Andrich (1978a, 1978b) suggests that an appropriate model 
for the ordered response categories is the Rasch Rating Scale 
Model. He comments that in order to overcome the objections to 
the integer-scoring procedures, a response model which keeps track 
of the person's categorical responses is needed. Andrich (1978b, 
p. 581) states, "a rr.ultidimensional parametric structure is 
postulated with respect to the response categories and after the 
parameter estimation is carried out, checks on the possible 
reduction of the dimensionality of this structure are made." 
. _ According to Wright and Master (1981), the relative 
difficulties of the "steps" on a rating scale are to be governed 
by the fixed rating points accompanying the items. Assuming the 
response format is the same, these steps should be invariant. The 
pattern of the steps at a location is determined by the item's 
response set and is estimated once for the complete set of items. 
Items then differ only by their location on the variable of 
interest. 

The rating scale is identified by Andrich (1978a, 1978b) and 
can be written as the probability of a person n responding in 
category x to item i. 



^ o -, ' ®^ ^ [B„-(5i+X3)] 
» , Rating Scale tl, = s-o ' 
x-u,i,...,in "^ix 



S exp E [B„-(5'^+t3)l 
where t:o=0 so that exp E [B„-(5i+X3)] = i. 
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When a rating scale is analyzed using this model, Wright and 
Master (1981) say that it is possible to obtain an estimate for 
each person n, and estimated scale value for each item and 
estimates of the response "thresholds" _,Xa, Xz, . . . for m + 1 
rating categories. Because these kinds of information are 
invaluable for utilizing teacher effectiveness characteristics for 
dj.agnostic purpose?, the Rasch Rating Scale Model was applied to 
the IMP data on both forms of the instrument. 

IMP DATA ANALYSIS 
To determine whether a measurement model fits the data, it 
is necessary '.o find out whether or not the items in the 
instrument are working together, to define the variable, and to 
what extent the person's responses fit the expectations of the 
model. The degree of precision to which the hierarchical 
arrangement of the item calibrations expressed in logits on the 
dimension supports the teacher effectiveness variable provides 
evidence for the construct validity of the IMP. Another ii^portant 
aspect of the data analysis is to determine the validity and 
placement of each person's measure and to observe whether or not 
these persons are separated along the dimension of teacher 
effectiveness. The extent to which the item calibrations and 
persons statistically fit the model is, in turn, representative of 
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the ability of the IMP to provide a diagnost " index of teacher 
effectiveness items to spread the teachers along 
the dimension. Therefore, the results of the IMP data analysis 
using the Rasch Scaling Model will provide varying degrees of 
instrument, item, person, and variable validity. 

Verifying the Variable Teacher Effectiveness of the IMP 
The effectiveness of 204 teachers as observed by their 
principals on the Observation Form of the IMP and the 
effectiveness of 211 teachers as reported on the Self-Report Form 
of the IMP both on the same 36 items, is discussed in this section 
of the paper. Each form of the instrument will be discussed first 
separately and then compared with each other on item calibrations 
positions to define the variable teacher effectiveness. 

Table 1 presents the scale item statistics in sequence order 
for both instruments. The Item names ST, PA, and TS refer to the 
three scales of the instruments: student teacher relationships, 
professional attitude, and teaching skills. Again, the scales are 
the same for both forms of the instrument. The items are 
presented at the end of this paper. It is interesting to note 
that there is a smaller standard deviation of the items and the 
standard deviation of the fit statistic on the Observation Form. 
The mean for the error is the same. 

Figure 1 lays out the teacher effectiveness variable on the 
Observation Form. The it.-r, are presented in logits along the - 



ERIC 



9 



Defining Effectiveness 

10 

dimension for the total instrun,ent as well as each of the three 
scales. The items are positioned at their calibrations and the 
semicircles represent the standard error of each item along the 
continuum. There is little deviation in the standard error of the 
- items, see Table 1. The items positioned at the top are the 
easier characteristics (variables) in teacher effectiveness on 
Which to observe teachers. The entire continuum is represented 
between -i.O and n.5. six of these items have fit statistics 
greater than +4.0 which is of some concern. 

Note that many of the easier items are on the first scale 
which is student teacher relationships, item 17 on the 
Observation Form is, "Enjoys working with ........ 

Item 30 is, "Enjo^teractino with ........ .. 

"^^°^i^es opportuniti.. fp. ,,,^,,3 exo.H.n.o . 

-Scess." All of these item calibrations position themselves 
between -l.o and the mean 0 and they cluster around the same few 
calibration points. Scale 1 appears to be an easier scale on 
which to observe teachers, it has an item calibration mean of - 



.399. 

See Table 2. 
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TABLE 2 

LOGIT MEANS AND STANDARD DEVIATIONS 
FOR THE IMP SCALES 
Self-Report Form 

Observation Form 

Scale 1 Scale 2 Scale 3 Scale 1 Scale 2 Scale 

3 

--674 .093 .364 -.339 .092 .153 

.459 .974 .495 .247 .577 .403 

Scale 2 is a Professional Attitude. For the most part, these 
items represent the teacher's attitude toward his/her profi^ssion. 
Item 34 on the Observation Form reads, " Am committed to i-hp 
primary goal of student growth " item 24 is, " Exhibits 
businesslike o r task -oriented behavior. " The item calibration 
positions are along the entire instrument continuum and the mean 
of the item calibrations on these items is .092. This scale 
appears to be representative of the whole scale and is slightly 
more difficult for administrators to evaluate teachers. 

The third scale is Teaching skills and it has an item 
calibrations mean of .153 which is the most difficult of the three 
scales on which administrators evaluate teachers. These 16 items 
also cluster around many of the same calibration points, item 3 
on Scale 3 is, " Plans interesting lessons. " item 5 is, "Relates 



ERIC 



Defining Effectiveness 

12 

instruction ^n i,.,4. — tional ob1 .^i-i„». 

Items on the Self-Report Fon, cover nK,re of the teacher 
effectiveness variable, it is interesting to note that the sane 
iten. calibrate in a similar pattern both instr™«nts, but are 
spread out .nore-on the Self-Report For.,. The entire range is 
represented on the variable fron about -1.6 to 2.0. There is a 
large break i„ the continuum between items 25, 4, and 12 which i. 
a concern. The standard error is greater on these easier items. 
M.ere are only two items which fit statistics greater than .4.0 
and would be candidates for revision or deletion. 

The first scale on the Self-Report Form ha, an item 
calibration mean of -.674, a wider range and more breaks in the 
conunu^ than does 4 on the Observation Fo^. »gai„ this scale" 
appears to be the easiest scale on which teachers evaluate 
themselves. 

scale 2 is ,^.,re spread out along the dimension than either of 
the other two scales. It also has more breaks in the continuum 
than it does on the Observation Form. The mean of the item 
calibrations is .093 which is almost identical to the mean on the 

Observation Form and is slightly more difficult than the first 

scale. 

scale 3 lays out over the more difficult end of the continuu:„ 
With a calibrated logit mean of .364. ^e item calibrations 
position themselves without a break in the continuum. 
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Comparing the scales across forms, the item calibrations are 
similar but not identical, and none of the items change positions 
drastically. There are several items with unduly large fit 
statistics and are candidates for deletion or revision from their 
respective instruments. 

Item calibrations on both instruments are spread along the 
dimension more so on the Self -Report Form than on the Observation 
Form. There is only a slight break in the continuum on the 
Observation Form and one large break on the Self-Report Form. 
Ideally there would be no breaks and both instruments would have 
items that have calibrated positions between +2.00. Nonetheless, 
both instruments have item calibrated positions that sufficiently 
cover the dimension of teacher effectiveness. The scales position 
themselves in a similar pattern along the dimension even though 
the variance is greater on the Observation Form. The item 
calibration positions on these instruments suggest evidence for 
construct validity. Figure 3 is a graphic representation of the 
comparison of the item difficulties of the two instruments. The 
correlation coefficient is .74 with a slope of .44 and an 
intercept of .03. The relationship is Ir.near which also supports 
the construct validity of the dimension teacher effectiveness in 
both instruments. 

Identifying Person Measures Along the Teacher Effectiveness 
Variable 
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Identifying Person Measures Along the Teacher Effectiveness 
Variable 

Person measures are spread along the logit scale from 6.67 to 
-3.85 on the Observation Instrument in Figure 4 (see columns 1 and 
2). Column one lists the per.son's raw score, the number of 
persons with that raw score, the person's measure position, and 
the erfor for that particular measure. Persons more toward the 
top of the scale are observed to be more effective teachers by 
their principal, whereas persons at the bottom of the scale are 
individuals perceived by the principal as having less teacher 
effectiveness ability. It is interesting to note the distribution 
of the persons on the variable in center column. The people have 
a mean logit of 1.72 with a standard deviation of 2.07 along the 
36 items. They are spread out to form a slightly negative 
Platekurtic distribution. However, the persons are spread out 
along the variable, which is essential for the model to fit the 
data. 

There are nine persons with fit statistics greater than 4.0. 
These people are candidates for deletion on the next data 
analysis. Person errors are not exceptionally high except on the 
high extreme of the distribution. The reliability of person 
separation is .98. This means that the Rasch Scaling Model is 
suitable for these items on observed teachers using the IMP 
Observation Form. Interesting to note, the item calibration 
positions in the last column compared to the person measure 
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positions in the middle column. The item calibrations cluster 
themselves more in the center and to the less able, easier, end of 
the variable. There is much more variability among the persons 
than the items. 

Person measures on the Self-Report Form have an entirely 
different distribution (see Figure 5). These measures position 
themselves with a mean of 1.93 and a standard deviation of .87. 
The curve appears skewed positively, but is much more normal than 
the Observation Form. However, there is much less variability 
among the subjects. This is understandable since self -report data 
has more of a tendency to be less variable with higher scores than 
observation data of similar content. There are four persons with 
fit statistics greater than 4.0. Again these people would be 
candidates for deletion on the next analysis. The reliability for 
the person separation on the Self -Report Form is .90. The Rasch 
Scaling Model is also suitable for these 35 items on the Self- 
Report Form of the IMP. 

The item calibration positions are spread out more on the 
Self-Report Form than they are on the Observation Form, while the 
persons have less variability on the Self -Report Form than they do 
on the Observation Form. These data support the objectivity of 
the model to separate persons and items independently of each 
other. 
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Note the boxes around the per .on frequency distribution on 
Tables 4 and 5. The boxed person is an elementary teacher 
responsible for students in grades kindergarten through grades 5 
or 6, depending on the configuration of the school building in 
which the person teachers. The unboxed persons are secondary 
teachers. Persons in both levels appear to be evenly spread out 
along the continuum on the Obssrvation Form. The Self -Report Form 
has more elementary teachers at the high end of the variable and 
fewer at the low end of the variable. There appears to be more of 
a discrepancy between professional levels on the Self-Report Form 
than on the Observation Form. According to Burry (1SS4) the items 
were originally written to be generic for both elementary and 
secondary teachers. Further analysis of persons and items is 
needed to conclude that indeed these items are generic in that 
they are interpreted the same at both professional levels. 

Probability Curves for Five Response Categories on the IMP 
The category probability curves for both forms of the IMP are 
presented in Table 6. Responses for both instruments were (A) 
almost never true, (B) seldom true, (C) true about half the time, 
(D) true most of the time, and (E) almost always true. The curves 
with the O's represent the almost never true; the I's the seldom 
true; the 2>s true about half the time; the 3's true most of the 
time; and the 4's almost always true. The numbers are down- 
shifted one from their raw score assignment in the computer 
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program for the Rasch Rating Scale Analysis. The X axis 
represents the person's ability in logits and the Y axis is the 
probability of the person with a specific ability responding to 
each of the successive categories. For example, a person with a 
measure of -2.0 on the Obser^/aticn Fon.i would have the probability 
of being rated seldom true 50% o<: the time. A person with a 
measure on each instrument at 0.0 rfcald have a probability of 57% 
of being rated true about half th- time on the Observation Form 
and a probability of 50% of rating themselves true about half the 
time on the Self -Report Form. 

The probability curves are ve-/:;,' different for each instrument 
which endorses the fact that although the items are the same, the 
perceptions of these items in their application are very 
different; there are two separate instruments. 

SUMMARY AND CONCLUSIONS 

The major question of investigation was whether the Rasch 
Rating Scale Model fit one or both of the forms of the teacher 
effectiveness instrument The Inventory of Myself as a Professional 
(IMP). Because the data in the literature in its current state of 
the art do not utilize the use of the Rasch Rating Scale Model, it 
is essential to answer this question first. It seems that both 
forms of the instrument appear to fit the model quite well. For 
the most part, item calibrations on both instruments had a miniimim 
of standard error and only a few items with exceptionally high fit 
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statistics. The items on both instruments sufficiently spread the 
persons along the variable teacher effectiveness. Only a few 
persons had extremely high fit statistics above 4,000. The 
reliability coefficient of person separation is .98 on the 
Observation Form a-.d .90 on the Self -Report Form. It appears that 
the Rasch Rating Scale Model is most appropriate for analyzing the 
IMP on both instruments. 

Other questions of investigation ask to what degree is the 
variable teacher effectiveness validated by the use of the model? 
Do the item calibrations and the person measures work together to 
define the variable on both forms or the instrument? Does the 
hierarchical arrangement of the item calibrations on the teacher 
effectiveness variable separate the persons along the same 
variable? 

The variable teacher effectiveness is defined in a small 
range from -.72 to 1.26 on the Observation Form by the 
hierarchical positions of the item calibrations. There is only 1 
small break at 1.0 on the continuum. The teacher effectiveness 
variable is defined. However, the definition could be 
strengthened by adding items with calibrated positions at both 
ends of the variable. The range on the Salf -Report Form is wider 
from- -1.32 to 1.91. Again, the variable is defined; however it 
could be strengthened by adding items with calibrated positions at 
this spread and the easier end of the continuum. The teacher- 
effectiveness variible is defined on both instruments and thus 
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providing construct validity to the IMP. 

The item calibration positions spread out along the variable 
with small standard errors of measurement. Both the Self -Report 
and the Observation Forms of the IMP provide a definition of the 
teacher effectiveness variable. Tho scales on both instruments 
position themselves in the same order of degree of difficulty on 
the item calibrations. This is of significance to teacher 
education programs in that the student teacher relationship scale 
is the easiest, the professional attitude scale is in the middle 
position, and for the most part, the most difficult items are the 
teacher skills. The teaching skills can and are taught in most 
teacher education programs. However, until this analysis was 
complete there has been no confirmation that these skills were any 
harder to perform or self-analyze than any other kind of skills in 
teacher effectiveness. This is an intriguing discovery. 

The item calibrations on both instruments spread out the 
person measures with a minimum of error and misfitted the persons. 
The -distributions are different; the Observation Form has a much 
greater variance on the person measures than does the self-report 
instrument. However, both populations are adequately spread out 
by the item calibrations to obtain adequate person measures. As 
Wright and Masters (1981) comment, with the use of the Rasch 
Rating Scale Model it is possible to obtain interval measurements 
of persons' responses to a Summative Rating Scale. This is a 
major contribution and improvement to the field of teacher 
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evaluation. 

The analysis of the category probability plots supports the 
use of each instrument as a separate instruinent, as the response 
patterns are very different. Burry (1984) concludes that little 
has been done to validate the use of the san.e items with two 
different applications. 

To-sunmiarize the results the Rasch Rating Scale Model fits 
both the observation and Self-Report Form of the IMP. The item 
calibrations sufficiently spread the person measures to ensure an 
adequate fit of the model on both forms. The variable teacher 
effectiveness is defined with some limitations. 

IMr_ther investigations are needed to analyze the data across 
teaching levels, item calibrations for each level would provide 
more insight into how teachers at each level view each item of 
teacher effectiveness and whether or not the dimensions of the 
scales change or remain constant. 

In all, the Rasch Rating Scale Model provides a major 
breakthrough in analyzing the variable of teacher effectiveness in 
the use of items calibrations on a hierarchical continuum. Having 
person measures contributes to the diagnostic potential of the IMP 
and other similar scales. Instruments with good psychometric 
properties can only facilitate and enhance the potential and 
application of the teacher effectiveness variable in training and 
assisting the development of effective professional teachers. 
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A. ALMOST WEVER TRUE tk« 

professional classrolm ceachar"' ^"""^ ^"e 

true. iassroom teacher says or does is almost never 
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„ half of the time. "acher says or does is true about 

D. TRUE MOST OF THE TI."E th- . 

professional clasLol tL'c%e""Irror1°"^ 

crue. <:«ner says or does is almost nevr 

E. ALMOST ALWAYS TRUE th« 
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CO provide feedback for .our^^Si^^d^^^x J^of e^^^o^-x^^^JroSL;^ 

cL%'sro'o«':eac"e%'':n%"'L preface'd'wich "As"."^?!.: ^"""''^^ « ^ 
■ • ." Respond Co each s?acemenc lilh \CLASSROOM TEACHER. I 

II J;sc;i^:^^-3^^^f-- - t^^i^ 

" a»Tlxa«fI?oM«?fac;Sr llr^l ^^^^ ^ ^ - 

" fSoir^eacSJ: 1^ or do as a 

C. TRUE ABOUT HALF OF m WME occasionally. 

„ do as a classroo^teacher 1, ? "acenenc abouc whac I 

D. TRUE MOST OF THE TIME The tLl ^^^^ cin,e. 
^ do as a classroon ceachlJ is err*"' ^ 

E- ALMOST ALWAYS TRUE Th^ Ir °^ ^ime. 

do as a classroSf celJLrirSsf :i^!ayT^Je"^ " 

wh^!ch"L^r%^^::"i^''■"^='^^ =aref.lly.- Selecc 
go-olecel. blacken Che lecJer^hL'h " " "•I'^^roon. c„cher 

ITo noc -Ju?r-H7cle the l^ccer B^ackr"'"'?" ^"""^ 
scacemenc. ^eccer. Blacken only 1 leccer for each 

PLEASE USE A NUMBER 2 PENCIL 
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TABLE 1 



SEI.P REPORT FORM 
SERIAL ORDER 



SCALE ITEM STATISTICS 
SERIAL ORDER 



SEO ITEM 
NUM NAME VALUE ERROR 



SFIT 





ST01 


-0.33 


0.11 


-2.24* 


2 


PA01 


0.70 


0.09 


-2. 10* 


3 


TS01 


0.66 


0.09 


-4.98* 


4 


TS02 


1 .07 


0.08 


-0.98 


5 


TS03 


-0.28 


0.11 


-1.21 


6 


CT02 


-0.46 


0. 12 


0.26 


7 


TS04 


0.01 


0. 11 


3.31* 


8 


ST03 


-0.04 


0.11 


-1.05 


9 


PA02 


0.90 


0.09 


4.65* 


10 


TS05 


-0.55 


0.12 


-1.03 


11 


TS06 


0.92 


0.09 


-0.20 . 


12 


PA03 


1.91 


0.08 


4.55* 


13 


TS07 


-0.23 


0.11 


-4.01* 


14 


ST03. 


-0.96 


0. 14 


-1.49 


15 


PA04 


-0.77 


0.13 


-2.62* 


16 


TS09 


0.23 


0.10 


-5. 17* 


17 


ST05 


-1. 15 


0.14 


-2.26* 


18 


PA05 


0.59 


0.09 


3.22* 


19 


TS09 


0.47 


0. 10 


2.75* 


20 


PA06 


-0.63 


0.12 


-2.61* 


21 


TS10 


0.25 


0.10 


-1.05 


22 


TS 1 1 








23 


ST06 


-0.58 


0. 12 


-3.82* 


24 


PA07 


-0.05 


0.11 


-0. 16 


25 


TS12 


1.07 


0.08 


0.37 


26 


TS13 


0.78 


0.09 


-3.08* 


27 


ST07 


-0.40 


0. 12 


-^.20* 


28 


PA03 


-0.59 


0.12 


-1. 15 


29 


ST08 


-1.2G 


0.15 


-0.56 


30 


ST09 


-1.32 


0.15 


-0.58 


31 


PA09 


0.26 


0. 10 


-5.61* 


32 


TS14 


0.29 


0. 10 


0. 14 


33 


ST07 


-0.23 


0. 11 


-3.03* 


34 


PA 10 


-1 .39 


0. 16 


-1.69 


35 


ST09'i> 


0. 10 


0. 1{ 


2.67* 


36 


TS16 


0. 19 


0. 10 


0.28 



OBSERVATION FOPxM 
SERIAL ORDER 




1 ST01 

2 PA01 
TS01 
TS02 
TS03 
ST02 
TS04 

8 ST03 

9 PA02 

10 TS05 

11 TS06 

12 PA03 

13 TS07 

14 ST03 

15 PA04 

16 TS09 

17 ST05 

18 PA05 

19 TS09 

20 PA06 

21 TS10 

22 .Tsn 

23 ST06 

24 PA07 
TS12 

26 TS13 

27 ST07 

28 PA08 

29 ST08 

30 ST09 
PA09 

32 TS14 

33 ST07 

34 PA 10 

35 ST09TS 

36 TS16 



-0. 19 
0.47 
0.06 
1.13 
0.02 
-0.37 
-0.20 
0.04 
0.66 
-0.25 
0.53 
1.26 
-0.47 
-0.38 
-0.30 
-0.02 
-0.72 
0.08 
-0. 10 
-0.34 
0.18 
0.21 
-0.60 
-0.32 
0.42 
0.77 
-0. 17 
-0. 10 
-0. 18 
-0.64 
0.20 
0.23 
-0. 18 
-0.69 
-0. 12 
0.06 



0. 11 
0. 11 
0. 11 
0. 10 
0. 11 
0. 11 
0. 11 
0. 11 
0.11 
0. 11 
0. M 
0. 10 
0. 11 
11 
11 
11 
12 
11 
11 
M 
11 
11 
t2 
11 
11 
11 
11 
11 



0. 11 
0. 12 
0. 11 
0. 11 
0.11 
0. 12 
0. 11 
0.11 



0.60 
-1.20 
-3.89* 

1.84 
-0.99 
-2.62* 
2.62* 
-0.93 
0.94 
0.90 
2.28* 
5. 16* 
-2.70* 
-2.01* 
-2.80* 
-1.59 
-3.24* 
3.64* 
0.62 
-0.60 
-1.88 

3.12* 
-3.09* 
2.07* 
0.86 
-2.03* 
-4.34* 

0.77 
-2.82* 
-0.68 
-3.55* 
-1.13 
-3.04* 
-2.94* 
-0.51 
-3.30* 



SUMMARY STATISTICS FOR SCALE ANALYSIS 



MEAN 
SO 



VALUE ERROR SFIT 



-0.00 
0.77 



0.11 
0.01 



-0.75 
2.98 



MEAN 
SO 



VALUE ERROR SFIT 



-0.00 
0.47 



0.11 -0.73 
0.0 2.38 




25 



Column 1 
Distribution of all 
36 Items and Their 
Standard Errors 

-2.0 r 



1.0 T 



2.0 



^*o%±t Scale 
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FIGURE 1 



DEFINING THE IMP VARIABLE BY ITEM DIFFICULTY 
OBSERVATION FORM ™^ 
Column 3 
Distribution of 
Scale 2 Items and 
Their Standard Errors 



Distribution of 
Scale 1 Items and 
Their Standard Errors 



-2.0 



-1.0-- 




1.0 



2.0 

Logit Scale 



-2. or 



-1.3-- 




2.0 

Logit Scale 
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Column 4 
Distribution of 
Scale 3 Items and 
Their Standard Errc 

-2.0" 




-7 




Column 1 
Distribution of all 
36 Items and Their ' 
Standard Errors 

-2.(h- 







1.0 
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FIGURE 2 

L'^FINING THE IMP VARIABLE BY ITEM DIFFICULTY 
SELF-REPORT FORM 



Column 2 
Distribution of 
Scale 1 Items and 
Their Standard Errors 

-2.0^ 



1.0 - 



2.0 

Logit Scale 
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Column 3 
Distribution of 
Scale 2 Items and 
Their Standard Errors 



-2.0t- 





®-> f) 
1.0 



© -^j 
2.0 ^ 

Logit Scale 
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Column 4 
Distribution of 
Scale 3 Items and 
Their Standard Er: 

-2.0 r 



-1.0 




2.0 

Logit Scale 
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FIGURE 3 

COMPARING ITEM DIFFICULTIES OF 
THE IMP OBSERVATION AND SELF-REPORT 
FORMS 

36 ITEMS 



2- 



loglts 
Observation 




0 1 2 

logits 
Self-Report 
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FIGURE 4 

SCALE MAP SHOWING POSITIONS OF PEOPLE 'aND ITEMS ON THE VARIABLE 

OBSERVATION JQ^ VARIABLE 
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i!!!?!!?^""** ERROR 



PeOPl.E(K<* 204) 



ITEM 



6.67 t.oi 



5) 5.S5 0.73 



«4t( O) 5,52 0.59 



MOt 3) 5.20 0.52* 
t39( 3) 4.95 0.48 



f38( 
f37( 



2) 
3) 



4.74 
4.55 



0.44 

0.41 



f3S( 2) 4.39 0.39 

13S( 8) 

134( 2) 

f33( I) 

f32( 2) 

t3t( 5) 

f30( 2) 

129( 1) 

128( 2) 

t27( 3) 

126( 3) 



125( 3) 
124( I) 
123( 4) 
131( 1) 
t20( 3) 
119( 3) 
t17( 4) 
1f6( a) 
1t4( 4) 
n3( 1) 
5) 

110( 3) 
f08( 3J 
107( 1) 
10S( S) 
103( 4) 
102( 2) 
100( 8) 
99( 3) 
97( 2) 
95( 6) 
94( O) 
92( 2) 
90( 4) 
88( 4) 
87( 5) 
85( 5) 
83( 7) 
8t( 3) 
79( 3) 
78( O) 
76( 3) 
74( 1) 
72( 6) 
70C 5) 
68C 2) 
66( 6) 
S3( 2) 
63( 1) 
61( 4) 
59( O) 
57( 3) 
55( 1) 
53( O) 
51( 2) 
49( 2) 
*7i f) 
45C 6) 
43( f) 
*1( 2) 
39( 2) 
37( O) 
35( O) 
34( o) 
32( O) 
30( O) 
28( O) 
27( O) 
25( 1) 
23( O) 
22( O) 
21( O) 
I9( 1) 
. f«( O) 
f7( o) 



5) 

W O) 



<.24 
4.tt 
3.98 
3.a7 
3.76 
3.65 
3. 55 
3.46 
3.37 
3.28 
3.19 
3.11 
3.03 
2.87 
2.79 
2.72 
2.57 
2.50 
2.37 
2.30 
2.t6 
2. to 
t.97 
1.S0 
1.78 
1.65 
1.59 
t.47 
1.41 
t.29 
1. 17 
1.11 
0.99 
0.87 
0.76 
0.70 
0.59 
0.48 
0.36 
0.25 
0.20 
0.09 
-O.02 
-0. 13 
-0.24 
-0.34 
-0.45 
-0.50 
-0.61 
-0.71 
-0.81 
-0.91 
-t.02 
-1. 12 
-1.22 
-1.32 
-1.42 
-t.52 
-1.63 
-1.73 
-1.83 
-1.94 
-2.04 
-2. 10 
-2.21 
-2.32 
•2.44 
-2.50 
-2.62 
-2.75 
-2.81 
-2.88 
-3.02 
-3. 10 
-3. 18 
-3.34 
-3.43 
-3. S3 
-3.63 
-3.73 
-3.85 



0.37 
0.36 
0.35 
0.34 
0.-33 
0.32 
0.31 
0.31 
0.30 
0.30 
0.29 
0.29 
0.28 
0.28 
0.27 
0.27 
0.27 
0.27 
0.26 
0.26 
0.26 
0.26 ' 
0.25 
0.2S 
0.25 
0.25 
0.25 
0.25 
0.25 
0.24. 
0.24 
0.24 
0.24 
0.24 
0.24 
0.24 
0.24 
0.24 
0.24 
0.24 
0.23 
0.23 I 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.23 
0.24 
0.24 
0.24 
0.24 
.25 



O. 
O. 

0.26 
.26 



.26 



O. 
O. 

0.28 
0.28 
0 
O 
O 

0.32 

.33 



.27 



.29 
.30 
.31 



xxxx 

XX 




ixicx 
xxxxxx 

XX XXX 
XX 

gbcxx 



X 
XX 

ISocxxx 



PA03 
»'S02 



TS13 
PA02 



PA01 TS06 
TS12 

TStO TS11 PA09 TS14 

TS01 PAOS TS16 

TS03 ST03 TS09 

TS09 PA08 ST09 

ST01 TS04 ST07 ST08 ST07 

TS05 PA04 PA06 PA07 

ST02 ST03 

TS07 

ST06 ST09 
STOS PA 10 



1.26(.tO) 5 
1.t3(.10) 2 



0.77(.11)-a 
0.66(.t1) 1 

0.53(.11)-1 2 
0.42C.11) 1 

0.23(.t1)-.2 3-4- t 
O.C6(. 11)«4 4-3 
-0.02(. 11)-1-l-a 
-0.12(.11) 1 1-.1 
-0.18(.11) 1 3-4-3-3 

-o.32(.it) i-a.t a 

-0.38C. 11)-3-2 
-0.47(,11)-3 
-0.64(.ia)-3-i 
-C.69(.12)-3-3 



O. 

0.34 



be 
III 



9^ 



« 

peRsoM QCALE WAP 

(FRCO) POSITIOW CRftioR 



FIGURE 5 

CT^OWING POSITIONS OF PEOPLET AND ITEt^ GIT TliZ VAT.IABLE 
SttUWiA cKL?r£5i*'^:^T PORK 



cKL?r£5i*'^:^T PORK. 



9b 



O) 5.73 



1) 5.03 



f .Of 



0.73 



a) 4.59 0.39 

<<0( 1) 4.i7 0.32 

«3( 2) 4.02 0.48 

t3«( O) 3.8i 0.44 



137( 
f36( 



3) 



3.62 
3.46 



0.41 
0.39 



t35( a) 

t34( t) 
133( 3) 

132; 3) 
13t( 6) 
130( 7) 
t29( 9) 
127( 15) 
f2«( 4) 
t25( 3) 
«24( 5) 
122( 11) 
t21( 8) 
119( 8) 
118( 10) 
116( 12) 
1U( 18) 
113{ 8) 
9) 

109( 7) 
t07( 8) 
f05( 4) 
102 ( 10) 
100( 11) 
98( 2) 
95( 4) 
93( 4) 
90( 2) 



3.31 0.37 
3.18 0.36 
9.06 0.34 



2.94 

2.83 

2.73 

2.63 

2.45 

2.37 

2.29 

2.21 

2.07 

2.00 

1.86 

1.89 

1.67 

1.S6 

1.50 

1.39 

1.29 

1.19 

1.09 



m 1) 

ttS( O) 

82( 2) 

79 ( O) 

76( O) 

73 ( O) 

70( 0) 

67( O) 

64( O) 

61( O) 

57( O) 

54( O) 

51( O) 

48( O) 

45( O) 

42( O) 

39( O) 

37( O) 

34 ( O) 

31( O) 

29( O) 

27( O) 

24( O) 

22( O) 

20( O) 

19( O) 

17( O) 

«( O) 

14( O) 

13( O) 

12( O) 

10( O) 

9( O) 



0.93 
0.86 
0.78 
0.66 
0.S8 
0.46 
0.39 
0.28 
0.17 
0.07 
-O.03 
-0.13 
-0.23 
-0.33 
-0.42 
-0.52 
-0.65 
-0.74 
-0.84 
-0.94 
-1-04 
-1.14 
-1.25 
-1.32 
-1.43 
-1 .55 
-1 .63 
-1.71 
-1 .85 
-1.95 
-2.03 
-2.10 
-2.22 
-2.35 
-2.42 
-2.49 
-2.57 
-2-74 
-2.84 



0.33 
0.32 
0.31 
0.31 
0.29 
0.29 
0.28 
0.28 
0.27 
0.26 
0.25 
0.25 
0.24 
0.24 
0.24 
0.23 
0.23 
0.22 
0.22 
0.21 
0.21 
0.21 
0.20 
0.20 
0.19 
0.19 
9. 19 
0.19 
0.18 
0.18 
0.18 
0.18 
0.18 
0. 18 
0.18 
0.18 
0. 18 
0.18 
0.18 
0.18 
0.19 
0.19 
0.19 
0.20 
0.20 
0.20 
0.21 
0.22 
0.22 
0.23 
0.24 
0.25 
0.26 
0.27 
0.27 
0.28 
0.31 
0.32 



8( O) -2.05 0.34 

7( O) -3.08 0.36 

6( O) -3.22 0.39 

*0)''^'.39 0.43 

4( O) -3.60 0.47 

3( O) -3.87 .0.36 

2i O) -4.23 0.69 
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^ &x 
jgpcx xx 

©XX 
_ XXX 

, Hocxxx 

SS^noQciOrxxxx 
txwwxxxx 

_ XXXXXXXX PA03 

..JSKSfxxxxx 

fflOJfflfflSXXXXX 

^Skxxxxxxxxxxxx 
Usexxxx 
xxxxxxxxx 
,^®cxxxx 
gocoocxx 

. 

IXSJSSSXXXXX 



XX 

xxxx 
xxxx 

XX 



TS02 TS12 

PA02 TS06 TS11 
TS13 

PA01 TS01 
PA05 

TS09 

PA09 TS14 
"TS09 TSIO TS16 
ST09 

TS04 ST03 
PA07 

TS07 ST07 
ST01 TS03 
ST07 . 
ST02* TS05 
PA06 ST06 PA08 

PA04 

ST03 
ST03 

ST08 ST09 
PA 10 



r^^— 0>— '4v91— ItOi- 



ITEM 

VALU6 sriT 



1.91(.08) 5 



1.07(.08)-1 O 

0.85(.09) 5 0 8 
0.78(.09)-3 
0.66(.09)-2-5 
0.59(.09) 3 
0.47(.10) 3 

0.29(.10)-6 O 
0.19(.10)-5-1 O 
O.IO(.IO) 3 
-O.04(.11) 3-1 
-0.05(.11) O 
-0.23(.11)-4-3 
-0.28(.11)-2-l 
-0.40(.12)-4 
-0.55(.12) 0-1 
-0.59(. 12)-3-4-l 

-0.77(. 13)-3 

-0.96(. 14)-1 

-1.15(.14)-2 

-t.32(.15)-1-1 

-t.39(.16)-2 



0 



FIGURE 6 



CATEGORY PROBABILITY CURVES FOR THE IMP 



OBSERVATION FORM 



coNTosiii Pitcsiiiiiif cumt to* mi mm 



. SEJ<F^RE?ORT FORM 



, cttwosiii rjoiA«iiiiv cvtnt iM ui iiiHs 



r O.flO 



ff 0.4O 



O II II 



0 a 

t 0 



S ] 

) 1 



I > 
I I 

IS 
S I 

1 I 

1 I 



000 *** III 



9 4 
I * 
» * 

• 3 

« II 



) "OOOOOOOOOOOOOOOOOO 1 1 1 1 1 1 1 1 1 1 1 



111 

lllllll 
lllllllllllllllllll 

1.00 «,< 



Ilil|»«i0 VUVI -3.M -l.lt 0.«] 3.00 



ff 0.40 



III 



33 : 

31 33 



113 



--"1111— — 

33 33 
S S 33 

> t 

a 1 s 
1 a « 

II s t 
0 I ] 
13 

0 3i 

» I 4 
» I 4 
9 0 1 4 
So II 4 
0 M 
0 4411 
Q4t II 
44 0» II 

oooo 



4 3 
4 j"' 



33 



33 



M33 



lltl 

tllll 3113 I 

• 1,00 .Tm «»«5«»»««'«««IIIMMIIIIII|||||||3II3III 

***~ •I.OO 1.00 s.oo 

tmificHO vuM -1.03 -o.tr 0*01** i.*i 



204 People (M = 1.72, S = 2.07) 36 Items 
Max. of- 5 Categories 144 Steps 



211 People (M = 1,93, S = 0.87) 36 Items 
Max. of 5 Categories 144 Steps 



